MFC: Mining Maximal Frequent Dense Subgraphs without Candidate Maintenance in Imbalanced PPI Networks
نویسندگان
چکیده
The prediction of protein function is one of the most challenging problems in bioinformatics. Several studies have shown that the prediction using PPI is promising. However, the PPI data generated from highthroughput experiments are very noisy, which renders great challenges to the existing methods. In this paper, we propose an algorithm, MFC, to efficiently mine maximal frequent dense subgraphs without candidate maintenance in PPI networks. Instead of using summary graph, MFC produces frequent dense patterns by extending vertices. It adopts several techniques to achieve efficient mining. Due to the imbalance character of PPI network, we also propose to generate frequent patterns using relative support. We evaluate our approach on four PPI data sets. The experimental results show that our approach has good performance in terms of efficiency. With the help of relative support, more frequent dense functional interaction patterns in the PPI networks can be identified.
منابع مشابه
A Parallel Algorithm for Mining Maximal Cohesive Subgraphs
Robust and scalable techniques for mining patterns or subgraphs in protein protein interaction (PPI) networks can help identify functionally relevant and coherent subnetworks. Recently, researchers have focused on integrating genes attributes with the protein-protein interaction networks for mining connected subnetworks whose genes are similar in a subset of attributes. However, most of the pro...
متن کاملk-Partite cliques of protein interactions: A novel subgraph topology for functional coherence analysis on PPI networks.
Many studies are aimed at identifying dense clusters/subgraphs from protein-protein interaction (PPI) networks for protein function prediction. However, the prediction performance based on the dense clusters is actually worse than a simple guilt-by-association method using neighbor counting ideas. This indicates that the local topological structures and properties of PPI networks are still open...
متن کاملComplexes Detection in Biological Networks via Diversified Dense Subgraphs Mining
Protein-protein interaction (PPI) networks, providing a comprehensive landscape of protein interacting patterns, enable us to explore biological processes and cellular components at multiple resolutions. For a biological process, a number of proteins need to work together to perform the job. Proteins densely interact with each other, forming large molecular machines or cellular building blocks....
متن کاملFP-GraphMiner-A Fast Frequent Pattern Mining Algorithm for Network Graphs
In recent years, graph representations have been used extensively for modelling complicated structural information, such as circuits, images, molecular structures, biological networks, weblogs, XML documents and so on. As a result, frequent subgraph mining has become an important subfield of graph mining. This paper presents a novel Frequent Pattern Graph Mining algorithm, FP-GraphMiner, that c...
متن کاملA survey of frequent subgraph mining algorithms
Graph mining is an important research area within the domain of data mining. The field of study concentrates on the identification of frequent subgraphs within graph data sets. The research goals are directed at: (i) effective mechanisms for generating candidate subgraphs (without generating duplicates) and (ii) how best to process the generated candidate subgraphs so as to identify the desired...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- JSW
دوره 6 شماره
صفحات -
تاریخ انتشار 2011